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METHOD AND SYSTEM FOR PROVIDING CACHE SET SELECTION WHICH IS 

POWER OPTIMIZED 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to instruction and data caches used in micro 
processor based computing systems for temporarily storing instructions and data. 
Specifically, a system and method are described which can address the associative 
cache in a high speed mode, or in a mode which conserves power. 

[0002] Computing systems, including micro-processor based systems, use a 
cache in conjunction with a main memory to hold data and/or instructions which are 
being processed. The cache comprises a memory where the temporary contents 
needed for processing are maintained so that the most recentiy used data from a main 
memory is located in a cache memory for rapid access by the micro-processor system. 

[0003] Cache memories are organized as set associative comprising sets of 
individual SRAMs which contain the desired data and which typically have conmion 
address lines. Each SRAM is referred to as a way, and in a two-way associative 
cache, conmion address lines are connected to each SRAM. Lines of multi byte data 
are stored in each location of the ways. The information contained within a line of the 
set associative cache memory is derived by an effective address 20 generated by the 
microprocessor system. The effective address includes a tag field, a line index field 
and a byte field. The tag field of an effective address is utilized to determine whether 
or not one or the other ways contains the data being sought. 
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[0004] Both ways may be simultaneously addressed, and data from one or the 
other of the ways may be selected by a multiplexer by comparing a tag derived from 
the address applied to the ways of the associative cache to a tag contained in a tag 
memory or directory. The tag memory includes a row of tag data corresponding to 
the same row number of data in a given way. Thus, a comparison between the 
contents of a row of a tag memory and a tag from the tag memory determines which 
way contains the desired data and a multiplexer selects the desired data from the 
identified way. 

[0005] In small computing systems, power efficiency becomes more important 
than was previously the case in earlier applications of set associative cache memories. 
Associative cache memories provide for higher speed data access when both ways are 
simultaneously addressed and cloclced, and a late select command to the multiplexer 
selects the data from one of the ways. While this provides for optimum access speed, 
power is dissipated in each of the SRAMs of the associative cache when only one 
SRAM contains the selected data. This represents a significant waste of operational 
power, particularly in battery operated devices such as cellular telephones which may 
use such microprocessor systems. 

[0006] To avoid the needless consumption of power by the way which does 
contain the desired data, some set associative cache memories have been provided 
with prediction logic. These systems all provide for a prediction of which way 
contains the requested data, and enable only the predicted way to produce the data. 
However, the prediction logic consumes power, and does not guarantee 100% 
predictability. Accordingly, more cache misses occur on a false prediction with only 
a marginal savings in power consumption. 
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[0007] In order to reduce power consumption, some designs reduce voltage 
levels or the operating frequency of the access cycle. There are limitations, however, 
to these techniques particularly lowering the operating frequency, since providing 
adequate time to make a set decision, and then obtain the required data, mandates a 
reduced maximum frequency of operation. 

[0008] In a paper entitled, "A 600MHz Single Chip Multiprocessor With 
4.8GB/S Internal Shared Piplined bus and 512kB Internal Memory", 2003 
International Solid-State Circuits Conferencerpg. 254, a set associative instruction 
cache is described having reduced power consumption for normal prefetch cycles. 
Tag memory access and data memory access are divided into two consecutive cycles 
and only one way is activated. On the other hand during branch conditions, tag 
memory access and data memory access of both ways are executed in the same cycle 
to enhance the performance. In this way, there are two variations of cache 
performance, one emphasizing low power and the other high performance. However, 
the trade off between power savings and obtaining higher access speed is limited to 
normal prefetch and branch conditions. Further, the access during normal prefetch 
operations are made over two cycles which significantly slows down the access 
process. Accordingly, it would be desirable to have a system which can be morphed 
between high performance, as well as a lower power consumption, in a variety of 
applications. 
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BRIEF SUMMARY OF THE INVENTION 

[0009] In accordance with the invention, a system and method are provided for 
accessing an associative data cache having at least two ways for storing data. First 
and second tag memories are provided for storing first and second sets of tags 
identifying each line of data stored in the ways of the cache memory. A translation 
device determines from the system address a tag identifying one of the ways obtaining 
data. First and second comparators compare the tag to a tag stored in first and second 
respective tag memories having the same address as the address applied to the first 
and second ways. When one of the comparator generates a signal, a multiplexer 
selects the corresponding way having the desired data. A clock signal circuit receives 
an access mode signal which controls whether one, both, or neither of the ways are 
clocked, depending on whether a power efficiency mode, or high access speed mode, 
is to be employed. 

[0010] In accordance with a preferred embodiment of the invention, the system 
can be selectively operated at optimum access speed or at optimum power efficiency. 
The power efficiency mode may be implemented by detecting the access cycle 
frequency, and when the cycle time is adequate to allow a serial access of the cache 
way which contains the desired data, the system accesses at most only one way. 
Further, when the access frequency is higher, demanding a high access speed, both 
ways are clocked and a late select signal is used to select the access data from one the 
way containing the desired data. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] FIG. 1 illustrates a first embodiment of the invention for controlling the 
access mode of an associative cache memory; 

[0013] FIG. 2 illustrates the timing of the circuit of Fig. 1 when the power 
efficiency mode is selected; 

[0014] FIG. 3 illustrates the timing of the circuit of Fig. 1 when the high speed 
access mode is selected; 

[0015] FIG. 4 illustrates a logic diagram of the access control circuitry of Fig. 1; 

and 

[0016] FIG. 5 illustrates the selection of a mode of operation based on power 
and frequency of the cache memory access. 

DESCRIPTION OF THE PREFERRED ElVlBODIMENT 

[0017] Referring now to Fig. 1, a two way associative cache 21 is shown. Way 
0, and way 1 comprise two SRAMs which produce first and second data outputs to a 
select multiplexer 25. The data cache memory 21 stores instructions, or other data 
necessary for a microprocessing system. The specific use of the data is not relevant to 
the invention's operation. 

[0018] The cache 21 is accessed firom a register containing an effective address 
19. The effective address contains data representing a tag associated with each line of 
memory in way 0 and way 1, a line index and a byte number. The Line Index portion 
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of the effective address identifies which line of tiie ways is to be accessed, and the 
byte identification data permits selection of a particular byte contained in a line of 
data. 

[0019] For each line or row of data contained in way 0 and way 1, a tag in a 
corresponding row of tag memories 22 is provided. The Line Index addresses both 
tag memories and both ways 0, 1. The tag data stored in tag memories 22 identified 
by the Line Index is compared with the effective address 19 tag data to determine 
which way contains data defined by effective address 19. The tag memory producing 
a tag which corresponds to the tag in effective address 19 corresponds to the way 
containing the desired data. In the event that neither tag memory 22 produces a 
corresponding tag, a cache miss occurs and the data must be recovered from the 
system main memory. 

[0020] Using a conventional translational look aside buffer 23, the tag 
information from address 19 can be reduced to an effective page number and a real 
page number corresponding to the tag associated with a row of memory in way 0, and 
way 1. Comparators 24 and 25 identify one of ways of the cache memory array 21 
when one of the tag memories 22 produces a tag corresponding to the tag obtained 
from the effective address 19. 

[0021] The cache memory array may be operated in either a power efficiency 
access mode, or in a high speed access mode. The power efficiency access mode is 
illustrated in Fig. 2. Clock pulses CLKO or CLKl are applied to only one of the ways. 
The selection of which way is to be clocked to produce data is made based on the 
determination of which way includes the data defined by effective address 19. When 
the access cycle is slow enough to permit a single way to be clocked, and data is 
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produced prior to the end of the access cycle, only one of the ways (shown as way 0 
in Fig. 2) is clocked if a hit is determined. A further selection of an individual byte in 
the data selected by multiplexer 25 is possible by the select circuit 30 in accordance 
with the byte data contained in the effective address 19. 

[0022] As is conventional in other associative cache memory systems, a miss or 
hit indication is produced by, respectively, NAND gate 27, or OR gate 28 so that the 
system can confirm that the data being sought was available, or was not available 
requiring the system to recover the data from system memory. 

[0023] When the microcomputing system is running an application which 
permits the power efficiency mode to operate, based on an assessment that sufficient 
cycle time is available to permit identification of the way containing the data defined 
by the line index of the effective address 19, only one of the ways 21 is clocked if a 
hit is determined. 

[0024] The high speed access mode which is used when the access cycle is 
reduced may be selected by clocking both ways of the cache memory 21. With the 
access mode set in the high speed access mode, both ways, way 0, way 1 are clocked 
as shown in Fig. 3, and a tag comparison is made with comparators 24. A late select 
signal is generated to select one or the other data outputs from way 0, or way 1 at the 
end of the access cycle. The byte select circuit 30 further distinguishes which byte or 
bytes of a line contained in the selected way is to be selected. 

[0025] An arrangement for providing clock circuit 20 is shown in Fig. 4. 
Referring now to Fig. 4, an access mode indication is applied to inverter 33. When 
the high speed access mode is selected, Al^JD gates 36 and 37 are enabled in response 
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to each received clock pulse to provide clock 0 and clock 1 signals to each of the 
ways of cache memory 21. When the high power efficiency mode is selected, 
comparators 24 A and 24B enable either AND gate 36 or 37, through OR gates 34 or 
35, thus providing clocking signals to only one way of the cache memory. 

[0026] The access mode may be controlled by system parameters, such as 
operating frequency, so that the determination can be made when the high power 
efficiency mode may be entered, or when the higher speed access mode is necessary 
for acquiring data in the cache memory 21. Fig. 5 illustrates the theoretical power 
consumption in both modes of operation of the circuit of Fig. 1. As can be seen from 
Fig. 5, during the power efficiency access mode, which is selected to occur when the 
frequency of data access is below a maximum frequency/2, significant power 
consumption efficiencies are obtained. Using the high speed access mode, where a 
late select is used to select one of the two way set associative cache memories, 
approximately 50% of the total power consumed is used in accessing the pair of ways. 
Accordingly, switching to the power efficiency mode lowers the total micro- 
processing system power consumption by 25%. 

[0027] Accordingly, the system can be morphed from the high access speed 
mode, sacrificing power efficiency, to the lower access speed power efficient mode. 
Changing access mode between power efficiency and high speed access can be 
implemented by determining the required access time. Control over the access mode 
can be either by software, based on parameters of the application being run in the 
microcomputer system, or manually implemented when it is known that a given micro 
processing system is dedicated to a particular application. 
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[0028] While the foregoing system has been disclosed to select between either 
one way activation, or two way, late select activation, is to be understood that the 
system could also be used to select between the prediction access mode and/or the late 
select, high speed operation. In this way, those applications requiring high speed 
access, both ways may be accessed with a late select feature, and in those applications 
where power is to be conserved, the prediction mode may be entered and used to 
select which way contains the requested data. 

[0029] The foregoing description of the invention illustrates and describes the 
present invention. Additionally, the disclosure shows and describes only the preferred 
embodiments of the invention in the context of a power optimized cache set selection, 
but, as mentioned above, it is to be understood that the invention is capable of use in 
various other combinations, modifications, and environments and is capable of 
changes or modifications within the scope of the inventive concept as expressed 
herein, conmiensurate with the above teachings and/or tiie skill or knowledge of the 
relevant art. The embodiments described hereinabove are further intended to explain 
best modes known of practicing the invention and to enable others skilled in the art to 
utiUze the invention in such, or other, embodiments and with the various 
modifications required by the particular applications or uses of the invention. 
Accordingly, the description is not intended to limit the invention to the form or 
application disclosed herein. Also, it is intended that the appended claims be 
construed to include alternative embodiments. 
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